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Abstract 

Properties of Boolean functions on the hypercube that are invariant with respect to linear transforma- 
tions of the domain are among some of the most well-studied properties in the context of property testing. 
In this paper, we study the fundamental class of linear-invariant properties called matroid freeness prop- 
erties. These properties have been conjectured to essentially coincide with all testable linear-invariant 
properties, and a recent sequence of works has established testability for increasingly larger subclasses 
of matroid freeness properties. One question that has been left open, however, is whether the infinitely 
many syntactically different matroid freeness properties recently shown to be testable in fact correspond 
to new, semantically distinct properties. This is a crucial issue since it has also been shown previously 
that there exist subclasses of matroid freeness properties for which an infinite set of syntactically dif- 
ferent representations collapse into one of a small, finite set of properties, all previously known to be 
testable. 

An important question is therefore to understand the semantics of matroid freeness properties, and 
in particular when two syntactically different properties are truly distinct. We shed light on this problem 
by developing a method for determining the relation between two matroid freeness properties V and Q. 
Furthermore, we show that there is a natural subclass of matroid freeness properties such that for any 
two properties V and Q from this subclass, a strong dichotomy must hold: either V is contained in Q or 
the two properties are "well separated" from one another. As an application of this method, we exhibit 
new, infinite hierarchies of testable matroid freeness properties such that at each level of the hierarchy, 
there are explicit functions that are far in Hamming distance from all functions lying in the lower levels 
of the hierarchy. Our key technical tool is an apparently new notion of maps between linear matroids, 
which we call labeled matroid homomorphisms, that might be of independent interest. 

1 Introduction 

The field of property testing, as initiated by [BLR93, BFL91] and defined formally by [RS96, GGR98], asks 
if, for a given property, there exists an algorithm which queries an input object a small number of times and 
decides correctly with high probability whether the object has the property or whether it is "far away" from 
the property. The property is called testable, or sometimes strongly testable or locally testable, if the number 
of queries can be made independent of the size of the object without affecting the correctness probability. 
Since such a tester receives only constantly many bits of information about the input object, a prerequisite 
for testability is that there be quickly detectable local obstructions to the property whenever the input object 
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is far from satisfying it. Perhaps quite surprisingly, it has been found that a large number of different 
natural properties satisfy this strong requirement and indeed admit property testers (see, for instance, the 
recent surveys [Ron08, Ron09] for more information). This raises the question what lies behind all of 
these testability results, and whether we can gain an understanding of some common underlying traits by 
investigating the space of testable properties. 

1.1 Linear Invariance and Matroid Freeness Properties 

One particular class of properties of interest is the set of linear-invariant properties of functions on the 
Boolean hypercube. We refer to the recent survey by Sudan [SudlO] for an in-depth discussion of linear 
invariance and more generally of the relation between invariance and property testing. Briefly, a linear- 
invariant property J 7 is a collection of functions U^Lii/ : ^ n ~~ ^ where F is a finite field and TZ is some 
finite range, such that if / is in J 7 , then / o L is also in F for every F-linear map L : ¥ n — > ¥ n , where o 
denotes function composition (/ o L)(x) = f(L(x)). In this work, we will mostly restrict ourselves to the 
most commonly studied case F = F2 and TZ = {0, 1}. 

As first explicitly pointed out by Kaufman and Sudan [KS08], a wide range of natural algebraic proper- 
ties, whose testability had been previously studied as special cases, are linear-invariant. Examples include 
such landmark results in the literature as the testability of linear functions [BLR93], Reed-Muller codes 
[BFL91, RS96, JPRZ04, AKK+05], and BCH codes [KL06]. In view of this, the goal of a sequence of re- 
cent papers [BCSX09, KSV10, KSV09, Sha09, BGS10, BCSX10] has been to try to explain these previous 
results in a uniform way by providing a general understanding of necessary and sufficient conditions for 
testability of linear-invariant properties. The focus of these works has been on so-called matroid freeness 
properties, 1 as defined next. 

Definition 1.1 (Matroid freeness). Given integers k, r > 1, a set M = {vi, . . . , v^.} of k vectors in F r , 
and a a string in lZ k , we say that a function / : F n — > TZ is (M, a)-free if there does not exist any linear 
map L : ¥ r — > ¥ n such that /(L(vj)) = cjj for all i € [k]. Otherwise, if such an L exists we say / contains 
(M, a) at L. 

Observe that for the purposes of this definition, the exact identity of the elements of M is not important; 
it is only important to know about the linear dependencies between them. Therefore, it is convenient to 
consider M abstractly as a linear matroid. 2 With this in view, we refer to the pair (M, a) in the above 
definition as a labeled matroid or a matroid constraint. 

Definition 1.1 can be generalized so that we have a collection of matroid constraints, instead of just one 
as above. 

Definition 1.2. Given a (possibly infinite) collection M = { (M , a 1 ), (M 2 , a 2 ), . . .} of matroid con- 
straints, a function / : F n — > TZ is said to be M-free if it is (M l , <7*)-free for all i. 

The class of M-freeness properties has turned out to be very convenient for a general analysis of the 
testability of linear-invariant properties. Work along these lines was initiated by [BCSX09] who showed 
that (M, cr)-freeness for functions / : Fg — > {0, 1} is testable for so-called graphic matroids M (as defined 
in Section 2), provided that a is the all-ones string, henceforth denoted a = 1*. It is easy to verify that 
such a pattern corresponds to a monotone property, i.e., that if / is (M, l*)-free and /' is obtained from 
/ by flipping /(x) from 1 to at any point x 6 FZf, then /' is also (M, l*)-free. The papers [KSV10] 

'it should be noted that strictly speaking, the properties in [BGS10, KSV10, Sha09] are described in terms of forbidding 
solutions to systems of linear equations rather then requiring the absence of matroid patterns. These two formulations are essentially 
equivalent, however, as explained in Appendix A. 

2 The formal definition of a matroid is not too important in this context, so although we provide a definition in Appendix A for 
completeness, in the rest of this paper the reader can just think of a matroid as a set of elements in a vector space over F. 
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and [Sha09] independently showed that the restriction to graphic matroids could be dropped, but this again 
requires that a = 1*. More recently, [BGS10] showed that if M = {(M 1 , a 1 ), (M 2 ,a 2 ), ...} is a possibly 
infinite collection of matroid constraints such that each M % is a graphic matroid, but where a % is no longer 
necessarily 1*, then M-freeness is testable. Furthermore, [BGS10] conjectured that the class of properties 
captured in Definition 1.2 above is exactly the class of linear-invariant properties testable with one-sided 
testers. In another work [BCSX10], it was conjectured that properties testable by so-called proximity- 
oblivious testers, a notion introduced in [GR09], are exactly the class of M-freeness properties with finite 
M. The important role of the (M, o")-freeness properties as the building blocks of potentially all testable 
linear-invariant properties thus provides a strong motivation to study their structure more carefully. 

Even though matroid constraints seem to arise naturally when characterizing the testability of linear- 
invariant properties, there is potentially a very problematic loophole in claims such as the one above from 
[BGS 10]. The issue is that although the claim shows the testability of M-freeness for infinitely many matroid 
constraint collections M, it is not at all clear whether these infinitely many M's characterize infinitely many 
distinct properties. Specifically, it could well be the case that the property of (M , <r 1 )-freeness is identical 
to (M 2 , cr 2 ) -freeness for two different matroid constraints (M 1 , a 1 ) and (M 2 , a 2 ). A little more subtly, it 
could also be the case that for distinct constraints (M , a 1 ) and (M 2 , a 2 ), any function which is (M , a 1 )- 
free is very close to being (M 2 , <r 2 )-free, so that even though the properties are not identical, any one-sided 
tester for (M 1 , <r 1 )-freeness can be easily modified to be a tester for (M 2 , <r 2 ) -freeness. A third pitfall is 
that it could be the case that for three distinct matroid constraints (M , a 1 ), (M 2 ,a 2 ) and (M 3 ,cr 3 ), the 
property of (M 3 , <r 3 )-freeness is (or is very close to) the union of the properties (M 1 , <r 1 )-freeness and 
(M 2 , <r 2 ) -freeness. In this case, testability of (M 3 , er 3 )-freeness is trivially guaranteed by the testability of 
(M 1 , <r 1 )-freeness and (M 2 , <r 2 ) -freeness. Thus, for the above cited result of [BGS 10] to be nontrivial, one 
should ensure that the properties covered in that result are not the union of properties already previously 
known to be testable. 

It should be stressed that these concerns are far from hypothetical. It was shown in [BCSX09] that 
if M is the graphic matroid on the fc-cycle for any k, then while there is an infinite hierarchy of distinct 
properties when a = l k , it turns out that (M, a) -properties when a ^ l k always degenerate to one of 
a finite set of properties that have already been known to be testable since the work of [BLR93]. It is a 
natural question to ask whether it could be the case more generally that all non-monotone matroid freeness 
properties degenerate to one of a small set of already well-studied properties. This is posed as an open 
problem in [BCSX09], and resolving this question is the main motivation behind this work. 

1 .2 Summary of Our Results 

Very briefly, given two matroid constraints (M, a) and (N,t), we establish necessary and sufficient con- 
ditions for when the two properties (M, <r)-freeness and (N, r)-freeness are identical or distinct, provided 
that the constraints satisfy certain structural conditions. We then go on to show the existence of matroid 
constraints that satisfy these conditions. Finally, we use these results to rule out the aforementioned ob- 
jections about testability results for matroid freeness properties by exhibiting infinite hierarchies of distinct 
non-monotone and testable matroid freeness properties. We now describe these results in some more detail. 

The main tool we use to show separations between matroid freeness properties is the notion of a la- 
beled matroid homomorphism. Just as the notions of graph homomorphisms and its variants are helpful 
in counting occurrences of (induced) subgraphs inside graphs (see [AS06] for a survey), labeled matroid 
homomorphisms allow us to count the number of times a given matroid constraint is contained in a func- 
tion. More precisely, we define a labeled matroid homomorphism <fi from a matroid constraint (M, a) to a 
matroid constraint (N, r) to be a map <f) that (i) is linear, (ii) maps elements of M to elements of N, and 
(Hi) preserves labels in the sense that the a-label of any element v in M equals the r-label of w = </>(v) 
in N. Observe that since 4> is linear, if some some elements of M are linearly dependent, then their images 
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in N are also linearly dependent. 

It is not hard to show that if there is a labeled matroid homomorphism from (M, a) to (N, r), then any 
(M, <r)-free function is also (N, r)-free. It is reasonable to wonder whether the fact that (M, a) does not 
map homomorphically into (N, r) can also provide some information about the relationship between the 
two properties of (M, <r) -freeness and (N, r)-freeness. If we are optimistically inclined, we might even 
ask whether the existence or non-existence of homomorphisms exactly determines the relationship between 
these two properties in the sense that (M, cr)-freeness is far in Hamming distance from being contained in 
(TV, r)-freeness in this latter case. Somewhat surprisingly, it turns out that this is in fact true for monotone 
matroid freeness properties. 

Theorem 1.3 (First main theorem (informal)). For any linear labeled matroids (M, 1*) and (N, 1*) it 

holds that either (M, \*)-freeness is contained in (N, l*)-freeness or (M, \*)-freeness is "well separated" 
from (N, l*)-freeness in the sense that there is a function that is (M, l*)-free but far from being (N, 1*)- 
free. The first case applies if there exists a labeled matroid homomorphism from (M, 1*) to (N, 1*), and the 
second case applies otherwise. 

Notice that this implies a strong dichotomy: one of the two cases in Theorem 1.3 must hold, and it can 
never be the case that the two properties are distinct but close in a property testing sense. 

It should be noted that some limited results of the same flavor were proven for specific graphic matroids 
and monotone patterns in [BCSX09], and our techniques are inspired by that paper. However, our results 
are much stronger in that they apply to arbitrary (non-graphic) linear matroids and provide an exact criterion 
for when two matroid constraints with monotone patterns are distinct. 

An obvious next question is whether this dichotomy, and the characterization in terms of labeled matroid 
homomorphisms, holds not only for monotone properties but for matroid freeness properties in general. 
Extending our methods to non-monotone properties presents significant technical hurdles, and indeed the 
general question remains a challenging open problem. However, we are able to identify two special cases 
when an exact characterization in terms of homomorphisms still applies. We restrict ourselves in the theorem 
statement below to graphic matroid freeness properties, which are of special interest since they were the ones 
shown to be testable in [BGS10], 3 although our results hold in slightly larger generality and in particular 
extend even to properties not currently known to be testable. 

Theorem 1.4 (Second main theorem (informal)). If (M, a) and (N, r) are graphic matroids that satisfy 
certain structural conditions but where a and r can be non-monotone patterns, then it holds that either 
(M, o~)-freeness is contained in (N, r)-freeness or it is "well separated" from it, and this is exactly deter- 
mined by the existence or non-existence of a labeled matroid homomorphism from (M, a) to (N, r). 

Our focus in Theorem 1.4 is on matroids over complete graphs K^, which in a sense are the building 
blocks of all labeled graphic matroids (again, we refer to Section 2 for a more formal discussion). Our 
technical contribution lies in using the structure of the complete graph to argue that if (M, a) does not 
embed homomorphically into (N, r), where M, N are graphic submatroids of the complete graph and a, r 
have some specific structure, then it is possible to pack into a function many copies of (N,t), i.e., many 
violations of (JV, r)-freeness, while still keeping the function (M, cr)-free. 

Finally, we apply these two dichotomy results to rule out the concerns discussed above about degeneracy 
of matroid freeness properties. 

Theorem 1.5 (Third main theorem (informal)). There are infinite hierarchies of testable, non-monotone 
and monotone matroid freeness properties such that for each hierarchy, consecutive properties in the hier- 
archy are well separated. Furthermore, it is not the case that any one of the properties equals ( or is close 

3 In fact, [BGS10] shows testability for a strictly larger class of so-called complexity-1 matroids, but we do not want to get into 
technicalities here. 
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to) the union of some subset of the other properties, and the properties provably cannot simply correspond 
to e.g. the well-studied class of low-degree polynomials. 

Thus, in particular, Theorem 1.5 allows us to conclude that the properties shown to be testable in 
[BGS 10] do indeed constitute a new, infinitely large class of testable linear-invariant properties. 

1.3 Organization of This Paper 

In Section 2, we provide some necessary background and elaborate more in detail on the motivation behind 
this work. Section 3 is devoted to establishing the dichotomy theorems. In Section 4, we prove the non- 
existence of labeled matroid homomorphisms, which can then be combined with the results from Section 3 to 
establish the infinite hierarchies of testable non-monotone properties in Section 5. We conclude in Section 6 
by discussion some of the intriguing questions left open by our work. Some background material, which 
might be useful although not necessary to understand the rest of the paper, is presented in Appendix A for 
completeness. 

2 Preliminaries and Motivation 

Let N = {0,1,...} denote the set of natural numbers and let N + = N \ {0}. Let n > 1 be a natural number. 
We write [re] to denote the set {1, 2, . . . , n}. We write F to denote a (finite) field. 

Formally speaking, a property V is a subset of functions from domain(s) V n to range(s) 1Z n , that is 
V = UneN+ where V n C U ngN +{P n — > TZn}, but it is customary to suppress re in this notation. 
Throughout this paper, we will have V n = F n and lZ n = 1Z for some fixed 1Z, usually 1Z = {0, 1}. 

Let f,g:V^-TZbe two functions defined over the same domain V. The (relative) distance between 
functions / and g, denoted dist(/, g), is the probability Pr X €T>[f(%) 7^ 9{x)] that they differ on some x 
drawn uniformly at random from V. The distance between a function / and a property V is dist(/, V) = 
min 5g -p{dist(/, g)}. We say that / is 5-far from V if dist(/, V) > 5 and 5-close otherwise. The following 
two definitions capture the notion of two properties being "well separated" from each other in a property 
testing sense. 

Definition 2.1 (<5-separated). For two properties ?,QC |J nGN+ {F n — » 7Z}, we say that Q is 5-separated 
from V if for infinitely many n there are functions f n : F n — > 1Z that are in Q but are 5-far from being in V 
(where we note that S > is fixed and in particular independent of re). 

Definition 2.2 (5-strictly contained). For two properties P,Q C UneN+i^" — * we sa y tnat ^ * s 
5-strictly contained in Q if V C Q but Q is 5-separated from V. 

In this work, we will not be too concerned with actual property testing, focusing instead on understand- 
ing the semantics of (syntactic) properties already shown to be testable by other means. In order for the 
discussion in this paper to be self-contained, however, we recall that a tester for a property V is a probabilis- 
tic algorithm which is given a distance parameter 5 and has oracle access to an input function / : V — > 1Z. 
The tester should accept with high probability, say at least 2/3, if / G V and should reject, also with prob- 
ability at least 2/3, if the function is 5-far from V. The tester is said to be one-sided if it has no false 
negatives, i.e., if functions in the property are always accepted with probability 1. The central parameter 
associated with a tester is the number of oracle queries it makes to the function / being tested. In particular, 
a property is called testable (or locally testable) if there is a tester with query complexity that depends only 
on the distance parameter S and is independent of the size of the domain V. 

Recall that a property V C UneN+ ~* ^} ^ s sa ^ to ^ e linear-invariant if / G V implies that 
/ o L G V for every linear transformation L : F n — > F n . This notion should not be confused with the 
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well-studied property of being linear, i.e., that if it holds for /, g : ¥ n — > ¥ that if / G V and g G V, then 
it must also be the case that / + g € V. Note that a linear-invariant property need not be linear. Indeed, in 
general this will not be the case, and in our setting we will not impose any algebraic structure on the range 
TZ of the functions. 

Turning next to matroids, for the purposes of this paper the reader can think of a linear matroid M 
as a set of vectors {vi, . . . , v^} in ¥ k for k' < k. We will often write N to denote some other matroid 
{wi, . . . , w m } over F m for w! < m, and unless otherwise stated Vj is assumed to denote a vector in M 
and Wj to denote a vector in N. We let ei, e2, ■ ■ ■ denote the unit vectors in the ambient space, i.e., is 
1 at coordinate i and everywhere else. Sometimes when we need to distinguish basis vectors of different 
matroids we will also write fi, ¥2, ■ ■ ■ to denote unit vectors (in some other space). The weight | v| of a vector 
v is the number of non-zero coordinates in v. We write a = {a\ , . . . , Cfc) € 7Z k and r = (n , . . . , r m ) € 7Z m 
to denote strings or patterns corresponding to the matroids M and N respectively. We let / and g denote 
functions F n -»■ TZ, where we will often, but not always, have F = F 2 = GF(2) and TZ = {0, 1}. Note that 
k,k' ,m,m! are all fixed while we think of n as going to infinity. 

A matroid M = {vi, . . . , v&} is said to be graphic if there exists a graph G with k edges for which 
these edges can be associated with the vectors vi , . . . , in M in such away that any subset of vectors 
S C {vi, . . . , Vfc} is linearly dependent if and only if the associated set of edges contains a cycle. In this 
case, we denote M by M{G). (Also notice that we write v and e for vectors to distinguish them from 
vertices v and edges e in graphs.) We require the graph G to be simple; that is, G has no self-loops or 
parallel edges. It is a well-known fact that graphic matroids always can be represented as binary matroids, 
i.e., linear matroids over F2. 

A central notion of this work will be that of a matroid homomorphism. 

Definition 2.3 (Matroid homomorphism [BCSX09]). Let M = {vi, . . . , v^} and N = {wj, . . . , w m } be 
two matroids with M C F fc and N C F m . A matroid homomorphism <p ■ M — > N is a F-linear map from 
F fc to F m such that 0(Vj) € {w l5 . . . , w m } for every 1 < i < k. We will also say that <p is an embedding 
of the matroid M into the matroid N, or that M embeds into N. 

In contrast to [BCSX09], we want to be able to study not only monotone but also non-monotone matroid 
freeness properties, i.e., properties characterized by matroid constraints (M, a) where we can have a ^ 
|0 fc , l fc }. In order to do so, we need the following generalization of Definition 2.3 to arbitrary matroid 
constraints. 

Definition 2.4 (Labeled matroid homomorphism). A labeled matroid homomorphism 4> : (M, a) — > 
(N, t) is a matroid homomorphism from M to N which in addition preserves labels in the sense that if 
4>(~Vi) = Wj, then Oi = tj. If there exists a labeled homomorphism from (M,a) to (N,t), we say that 
(M, a) embeds into (N, r) and write (M, a) c — >■ (N, r); otherwise, we write (M, a) (N, r). 

Let us now fix TZ = {0, 1} for the rest of this section. We can visualize a graphic matroid constraint 
(M(G),a) as the graph G with 0/1-labels Uj on its edges. In what follows, we will sometimes identify 
(M(G),a) with this labeled graph (G,a). We say that (G,a) is a labeled subgraph of (H,t) if G is a 
subgraph of H such that the edge labels of the common edges coincide. For graphic matroid constraints 
(M (G), a) and (M(H),t), a labeled matroid homomoiphism is simply a mapping of edges in G to edges 
with the same labels in H such that cycles in G map to cycles in H. Clearly, if (G, a) is a labeled subgraph 
(H, t), then the embedding of the vertices of G in H induces a matroid homomorphism in the natural way. 
These are not the only labeled graphic homomorphisms, however. In particular, a matroid homomorphism 
need not map edges incident to the same vertex in G to incident edges in H. 

As long as we are only studying monotone patterns, we need not worry too much about exactly how our 
linear matroids are represented. When we want to discuss (M, cr)-freeness for a non-monotone pattern a, 
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however, we have to specify exactly how M is represented in order to make sure that the matroid constraint 
is well-defined. 

Definition 2.5 (Standard binary matroid representation). We say that a binary matroid M = { vi , . . . , v& } 

is in standard representation if there is a d < k such that Vj = for i = 1, . . . , d and the rest of the vectors 
are linear combinations of these vectors, i.e., for all j > dwe have Vj = Yliei e « f° r index sets Ij C [d], 
where Vj, j > d, are enumerated in lexicographical order with respect to Ij. 

It is easy to see that any matroid has a standard linear representation, namely by associating unit vectors 
to a basis of the matroid and then representing the other elements by appropriate linear combinations of the 
basis elements. When we talk about matroid constraints from now on, it will always be for linear matroids 
in standard representation. 

We remark that the standard representation does not uniquely specify matroid constraints (M, a) — 
for a fixed M in standard representation there can be several distinct patterns a 1 , a 2 ,. . . such that (M, a 1 ) 
all represent the same labeled matroid. For instance, for the complete graph K d on d vertices, all labeled 
matroids (M(Kd), l dl 01 d2 ) are easily verified to be the same for all d\ + c?2 = (g) — 1- However, it is 
not the case that for all dx + d 2 + d 3 = ($) - 2 the labeled matroids (M(K d ), l dl 01 d2 01 d3 ) are all the 
same. On the contrary, one can prove (although we do not do so here) that one gets two different cases 
depending on whether the two 0-labeled edges are incident to a common vertex or not. The point is that the 
standard representation of M gives us at least one well-defined description of the labeled matroid, whereas 
the example just discussed shows that without such a representation a non-monotone labeled matroid (M, a) 
could be ambiguous. 

In this work, we will focus on a particular class of matroid freeness properties, the understanding of 
which is arguably fundamental to the broader study of the space of testable linear-invariant properties. Let 
us explain what we mean by this. 

It was shown in [BGS10] that any linear-invariant property in Fg — > {0,1} that is testable with a 
one-sided tester can be written as an M-freeness property for a possible infinite collection M of matroid 
constraints. 4 In other words, any such property can be characterized as the intersection of a possibly infi- 
nite number of (M, <r)-freeness properties. Moreover, each (M, <r)-freeness property can be written as the 
intersection of a finite collection of (F d , o"')-freeness properties, where F d is the full linear matroid, i.e., the 
full linear space Fd = e i '• I — [^]> I ^ °f some dimension d. Namely, suppose that the matroid 

M lives in Fg and let us for simplicity assume that it consists of the first k < 2 d — 1 vectors in the standard 
representation. Then it is straightforward to verify that (M, cr)-freeness is exactly the intersection of all 
(Fd, o"r)-freeness properties, where r ranges over all patterns in {0, l} 2 M*+i) and err denotes concate- 
nation. (This corresponds to that a violation of (M, <r)-freeness occurs as soon as the first k vectors in F^ 
are mapped to points in ¥ n evaluating to the pattern a G {0, l} fc , regardless of what the evaluation pattern 
looks like for the rest of the points.) As another example of the expressive power of matroid constraints, note 
low-degree polynomials (with constant term zero), can be specified as the intersection of all (Fd, r)-freeness 
properties, where d is fixed and r ranges over all patterns in{0,l} 2 _1 of odd parity. 

It follows from the preceding paragraph that all linear-invariant properties in FJ? — > {0, 1} that are one- 
sided-testable are collections of full linear matroid freeness properties, so matroid constraints (Fd,o~) can 
be seen to be the building blocks of all one-sided-testable linear-invariant properties. In the other direction, 
[BGS 10] established that any collection M of graphic matroid freeness properties is testable by a one-sided 
tester. It is again not hard to see that for any graphic matroid contraint (M(G), a), where G is a graph on 
d vertices, one can represent (M(G), cr)-freeness as the intersection of all (M(Kd), o"r)-freeness properties 
where a labels the edges of G and r ranges over all possible labels on the edges in i-Q not present in G. 

4 Modulo a standard technical assumption that is not relevant to this discussion — we refer to [BGS 10] for the precise statement. 
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e2 + e 3 



Figure 1: The standard matroid representation for M(K 5 ). 

Thus, to gain an understanding of the semantics of matroid freeness properties in general, and of the 
testability results in [BGS10] in particular, a necessary first step is to comprehend graphic matroid con- 
straints (M(Kd), a). Furthermore, even this first step appears to be a challenging problem in its own right. 
Therefore, in what follows we will mostly focus on matroids over complete graphs. For the rest of this 
paper, we fix the representation of such matroids as follows. 

Definition 2.6 (Standard representation of complete graph matroids). We choose the d — 1 independent 
basis vectors of M(Kd) to be the d — 1 edges incident to some (arbitrarily chosen but) fixed vertex. The („) 
vectors in M{Kd) will then consist of all the d — 1 weight-1 vectors and all the ( d 2 *) weight-2 vectors in 
Fg -1 . Moreover, these (i) vectors are always ordered lexicographically as 

M(K d ) = {ei,e 2 , . . . ,e d _i,ei + e 2 ,ei + e 3 , . . . , ei + e d _i,e 2 + e 3 ,e 2 + e 4 , . . . ,e d _ 2 + e d _i} . 

See Figure 1 for an illustration of the standard matroid representation of M(K^). We make the obser- 
vation, which will be used later, that for any fixed vector v € M^Kj) we can find a standard matroid basis 
that that contains v and is on the form in Definition 2.6, i.e., where every non-basis matroid vectors is a sum 
of two basis vectors. For the weight-1 vectors this is by definition, but it also holds for weight-2 vectors 
by symmetry. Namely, if we fix a vector + e_, for i < j, it is immediate to verify that the set of vectors 
{ei, ej + ei, . . . ej + e^-i, ej + e^ + i . . . ej + e^, . . . e$ + e^-i} and their pairwise sums generate M{K<i). 
Another easy way of seeing this is perhaps to take a look at Figure 1 and make a "proof by picture." 

As a final notational convention, we make explicit our use of wildcards in patterns, allowing the use 
of * when the meaning is clear from context. For instance, we will write 1* to denote the all-ones pattern, 
and d l* denotes the pattern with ones everywhere except in the first d positions (relative to some fixed 
representation of the matroid in question). 

3 A Method for Proving Distinctness of Matroid Freeness Properties 

In this section, we develop a method to determine the relations between matroid freeness properties by 
way of labeled matroid homomorphisms. In brief, we establish a connection between the existence of 
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an embedding from (M, a) to (N,t) on the one hand and the distance between the property of being 
(M, a)-free and the property of being (N, r) -free on the other. Namely, we show that if there exists a 
labeled homomorphism between (M, a) and (N, r) then (N, T)-freeness contains (M, (j)-freeness, and that 
otherwise, at least in some specific cases which we characterize, these properties are well separated in the 
sense of Definition 2. 1 . We find this quite surprising, since it is not at all clear a priori how to amplify non- 
existence of embeddings into statistical distance between the associated properties. We note that it remains 
an intriguing open problem whether this connection holds in general for any pair of binary matroids (M, a) 
and (N,t). 

Let us start with the lemma establishing the easy direction of the connection between labeled matroid 
homomorphisms and containment of matroid freeness properties. 

Lemma 3.1. If M and N are any linear matroids such that there is a labeled matroid homomorphism from 
(M, a) to (N, t), then (M, o~)-freeness is contained in (N, r)-freeness. 

Before proving the lemma, we state a simple corollary that will be useful later on. 

Corollary 3.2. If (G, a) is a labeled subgraph of (H, r) and f : ¥1% — >• TZ is a (M (G), a)-free function, 
then f is also (M(H), r)-free. 

Proof of Corollary 3.2. If (G, a) is a labeled subgraph of (H, r), then in particular there is a labeled matroid 
homomoiphism from (M (G), a) to (M(H),t), namely the one induced by the embedding of the vertices 
of G in H. The claim now follows by Lemma 3.1. □ 

Proof of Lemma 3.1. Let : M — > N be a labeled matroid homomorphism. Suppose that / : F n — > 1Z is 
not (N, r)-free and in particular that / contains (N, r) at a linear map L : N — > F n . We claim that this 
implies that / contains (M, a) at the linear map L o <p : M — > ¥ n . By assumption, for all j £ [m] we have 
/(L(wj)) = Tj. Suppose </>(vj) = Wj 4 . Then by definition o~i = Tj i since preserves labels, and for all 
i € [k] it clearly holds that f((L o c/>)(vj)) = f(L((j)(vi))) = /(^(w^))) = Tj i = o~i, establishing the 
claim. □ 

Lemma 3.1 provides a method of arguing that some syntactically different properties are in fact iden- 
tical. Consider for example (M(K 3 ), l*)-freeness and (M(iv" 4 ), l*)-freeness. Since (K 3 ,l*) is a la- 
beled subgraph of (i^4,l*), clearly (M(K^), l*)-freeness is contained in (M{K^), l*)-freeness. Per- 
haps somewhat counter-intuitively, we can also show that there is a labeled matroid homomorphism from 
{M{K^), 1*) to (M^K's), 1*) and hence the containment holds in the other direction as well. To see this, 
write M(K%) in standard representation over ei,e2 and M{K±) in standard representation over fi,f2, f3. 
Define cp : M(K^) — > M(K 3 ) by <p(ti) = ei, (^(f 2 ) = e 2 , and 0(f 3 ) = ei + e 2 and extend it to all of 
M(K±) by linearity. We leave it to the reader to verify that </>(f; + f,) G M(K 3 ) for all 1 < i < j < 3. 
Since (p is trivially label preserving when all vectors are 1-labeled, it follows that is a labeled matroid 
homomorphism. We write this down as a proposition for reference. 

Proposition 3.3. The labeled matroid (M(K 4 ),1*) embeds into (M(K 3 ),1*), so (M(K 3 ),l*)-freeness 
and (M^K^), l*)-freeness is the same property. 

To provide some more intuition, we give another hopefully instructive example, this time of non- 
identical properties. Observe that any function which is (M(K^), 011)-free is also (iW(iQ), 011111)-free 
by Corollary 3.2, since (i^3,011) is clearly a labeled subgraph of (K4, 011111). Also, it is not too hard 
to show that (M(K 3 ), 011)-freeness and (M(i^4), 011111)-freeness are not exactly the same. To see this, 
fix any y £ \ {0} and consider the function / y : — > {0, 1} defined by / y (x) = 1 if x = y and 
/ y (x) = otherwise. We want to argue that f y is (M(if 4 ), 011111)-free but not (M(K 3 ), 011)-free. Let 
again M{K 3 ) be represented over unit vectors &\,&2 and M(Ki) over unit vectors fi,f2,f3- The linear 
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map L\ : M(K^) — > F% sending ei to and e 2 to y gives a pattern (Oil) for M(K^). Now suppose 
that / y would contain {M(Ki), 011111) at some linear map L 2 : M(Ki) — > F^. Then in particular 
/ y (L 2 (f 2 )) = / y (L 2 (f 3 )) = 1, so we must have L 2 (f 2 ) = L 2 (f 3 ) = y. But then / y (L 2 (f 2 + f 3 )) = 
/ y (L 2 (f 2 ) + L 2 (f 3 )) = / y (0) =0/1. Contradiction. Hence f y is (M(if 4 ), 011111)-free. 

Note that the above argument shows that (M(Ks), 011)-freeness and (M{Ki), 011111)-freeness are 
distinct, but does not rule out that the two properties are "essentially" the same in that they are very close 
in Hamming distance. However, it follows as a corollary of results that we will prove later in this paper 
(see Lemma 5.2) that not only does (M{Ki), 011111)-freeness contain (M(K^), 011)-freeness, but this 
containment is strict in the sense of Definition 2.2. 

We next consider the other, harder, direction in the correspondence between labeled matroid homomor- 
phisms and matroid freeness properties. 

3.1 The High-Level Idea 

We want to reduce the problem of separating (M, a)-freeness from (N, r)-freeness to a question about 
matroid homomorphisms. Our goal is to prove a statement of the following kind: 

Suppose that (M, a) and (N, r) are matroid constraints such that there is no labeled matroid 
homomorphism from (M, a) to (N, r). Then the property of (M, a)-freeness is 5-separated 
from that of (N, r)-freeness. 

To prove such a statement, we need to exhibit an infinite family of functions f n : ¥ n —> 1Z for n — > oo that 
is (M, cr)-free but far from (N, r)-free. The general outline of the argument is as follows: 

1. First, we define a "canonical function" f Nr :¥ n ^1Z that encodes the structure of the labeled 
matroid (N,t). More precisely, suppose that N = {wi, . . . , w m } C ¥ d . Then f N is constructed 
by splitting x € F n into {y|z} for y € ¥ d and z G ¥ n ~ d and letting /jy r (x) be (in some sense) the 
indicator function for whether y 6 ¥ d is a vector in N and if so what label it has. 

2. Then, we prove that f N T is dense in instances of the matroid pattern (N, r) and has to be changed in 
many positions to become (N, r)-free. 

3. Finally, we assume that f N T contains an instance of the matroid (M, a) as witnessed by the linear 
transformation L : M — > F n . Then we want to argue that composing L with the projection ir that 
maps x = {y |z} to y, we obtain a labeled matroid homomorphism it o L from (M, a) to (N, r). But 
this contradicts the assumption that there is no such homomorphism. 

To construct a function that is dense in a pattern is relatively straightforward, and we achieve this (es- 
sentially) by a padding argument. The hard part is the third and final step in the argument. Notice that ir o L 
is linear by construction, but we are only guaranteed that it maps M to ¥ d , not into N. In general, most 
vectors in ¥ d are not in N, so we somehow have to make sure that we land only in this subset of vectors. 
Furthermore, if it holds that (ir o L)(vj) = Wj, then we must make sure that the labels cr, and tj agree. We 
remark that in fact, it is not at all clear what it actually means that f N should be an "indicator function" 
for (N, t). The function f N T has to map all of F n to 1Z, and in general for each value tj € 1Z there will be 
some vector Wj € N such that Tj is the correct value. However, all the (majority of) vectors x € F n that 
do not correspond to vectors in N also have to map somewhere in 1Z, and we need to detect that when such 
a vector maps to Tj, this does not indicate that x is a vector in N labeled by Tj. This is the tricky part, and 
indeed we do not know how to accomplish this for completely general labeled linear matroids (M, a) and 
(N, t). However, by imposing some structural restrictions on our matroids, we can still derive theorems of 
the same type that yield strong results when applied in the right way. 
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3.2 Canonical Functions for Labeled Matroids 

Given a labeled matroid (N, r) over vectors {wi, . . . , w m } in ¥ d and with pattern r = (t\, . . . , r m ) £ 1Z m , 
we construct the canonical function for (N, r) as follows. 

Let J C [n] be any fixed subset of size d. We think of J as the coordinates for the part of x € F n 
that will correspond to the indicator function for N . (For now, the reader can fix this set to be {1, . . . , d} 
for simplicity.) For J = {ji, . . . , jd-i} and a vector x £ F n , we will write x[J] to denote the vector x 
projected to its coordinates in J, i.e., x[J] = {xj 1 , xj 2 , . . . , Xj d }. Below, we will write x = {y|z} to denote 
the decomposition of x £ F" into y = x[J] £ ¥ d and z = x \ x[J] £ ¥ n ~ d . We let S be any subspace of 
¥ n ~ d of high dimension. To be concrete, let us set the dimension to n — d — 1, which means that S contains 
half the points of ¥ n ~ d . (However, to make it easier to follow the arguments below, the reader can think of 
S as being all of F n ~ d .) We let b £ 1Z denote a "padding value." Loosely speaking, we will let our canonical 
function evaluate to b on points that do not correspond to vectors in N. The parameters J and S are not 
important for the proofs, so we will suppress them in our definition of the canonical function for (N, r), and 
the same goes for the dimension n. 

Definition 3.4 (Matroid canonical function). Let N = {wi, . . . , w m } be any linear matroid in ¥ d labeled 
by r = (n, . . . , T m ) £ 7Z m , and let n be a dimension parameter. Fix any J and S as described above and 
write x £ F n as x = {y|z} for y = x[J] £ ¥ d and z = x \ x[J] £ ¥ n ~ d . Then for b £ 7Z, the b-canonical 
function T : ¥ n — > 1Z of the labeled matroid (N, r) is defined by 



f b N j x ) = A T ({y|z» = <; 



if y = and z £ S; 

tj if y = Wj £ N and z £ S; 

b otherwise. 



The function encodes (N, r) in the sense that it is dense in the matroid pattern (N, r), as well as 
in any (M, a) that maps homomorphically into (N, r). 

Lemma 3.5. If there is a labeled matroid homomorphism from (M, a) to (N, r), then the function : 
¥ n — >• TZ is 5-far from being (M, a)-free, where 5 > is some constant independent ofn and b. 

Proof. Let <p : (M, a) — > (N,t) be a homomorphism. Suppose that N = {wi, . . . , w m } for Wj £ ¥ d , 
and let L : ¥ d ->■ F" be any linear transformation sending wj to {wjjzj} for ai - bitrary zj £ S. Then for all 
Vj £ M, if 0(vj) = Wj it is easy to verify that /^ T ((L o (/>)(vj)) = r ((L(wj)) = Tj = Uj, where the 
last equality holds since <p preserves labels. Hence, r contains (M, a) at L o 0. 

The proof of 5-farness closely follows a similar argument in [BCSX09]. Set 5 = l/(g|F| d ) where 
q = |F| > 2. Our approach is to show that any function that is l/((/|F| rf ) -close to f^ T contains (M, a) 
somewhere, which is clearly equivalent to that is <5-far from (M, cr)-free. To this end, we fix a function 
g with dist(/^ , g) = 5' < l/[q\¥\ d ). We will show that g contains (M , a) at some linear map L'. Let 
S be a subspace of F n_d of codimension 1 as defined above. Because |5| = \¥\ n ~ d /q, clearly we have 
Pr y6 F<*, z6 sL/V({yN}) + ff({yN»] < g«5'. Fori £ [m], let ^ = Pr ze5 [/^ r ({ Wi |z}) ^ s({w<|z})]. 
Since 

^E 5 ^ y6F ^ zes [^({yi z })^^y! z })] . (3-D 

we therefore have ^S=i ^ ^ g'lFj 6 ' • 5' < 1. Now consider a random linear map Zi : ¥ d -4 5, and its 
extension L : F^ — >• F n given by L(y) = {y |Li (y)}. For every non-zero y and in particular for y £ N, we 
have that Li(y) is distributed uniformly over the subspace S. Thus, for any fixed i £ [m], we have 

Pr[ 5 (Z( Wi )) ^ 75] = Pr[ 5 (Z( Wi )) ^ /^ >T (L(wi))] < ^ . (3.2) 

Li ii 
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By the union bound, we get that 



Pr[3 i such that g(L(w»)) ^ n] < ^ 8j < 1 . 



1 



(3.3) 



In other words, there exists a linear map Li (and thus L) such that for every i, g(L(wi)) 
contains (N, r) at L and hence (M, a) at the linear map L' = L o <p. 



n and so g 
□ 



The following observation will come in handy later on. Suppose that, instead of letting S denote a 
fixed subspace as above, we associate to each w £ JV U {0} an independently chosen random subset 
5 W C ¥ n ~ d of density 1/2. And then, suppose that we modify the construction of the 6-canonical function 
in Definition 3.4 to be: 



We claim that Lemma 3.5 holds true with constant probability over the choices of all S* w . This is because 
we can cany out the analysis of the last paragraph in the above proof with probability over L\ and all 5 W 
(instead of just L\) and then apply a version of the Markov inequality. It is also easy to see that if Tj is 
distinct from the padding value b, then different choices of 5 W . in (3.4) yield distinct functions. In this way, 
we can obtain a very large family of canonical functions that are (JV, r)-free but far from being (M, er)-free. 

3.3 Two Dichotomy Theorems 

In Section 3.2, we carried out the first two steps in the proof outline in Section 3.1. We now present two 
classes of pairs of labeled matroids (M, a) and (JV, r) for which we can also successfully complete the 
crucial third step, and thus establish a dichotomy in the sense that if containment does not hold between the 
two properties, then they must be well separated. 

Theorem 3.6 (First dichotomy theorem). Let M, N be any linear matroids and let r be any pattern 
for JV. Then (M,l*) -freeness is contained in (N,t) -freeness if and only if there is a labeled matroid 
homomorphism from (M, 1*) to (JV, r); otherwise (M, l*)-freeness is 5-separated from (JV, r)-freeness. 

Before proving this theorem, we note that as a corollary it immediately yields our first main result in 
Theorem 1.3 on page 4 providing a full characterization of monotone matroid freeness properties. This 
follows simply by setting r = 1*. 

Proof of Theorem 3.6. The "if" part of the claim is Lemma 3.1. For the "only if" direction, let us assume 
that (M, l*)-freeness is contained in (JV, r) -freeness. We need to prove that this implies that there is a 
labeled matroid homomorphism from (M, 1*) to (N,t). Consider the canonical function f^ T for (N,t) 
padded with zeros. We know from Lemma 3.5 that f^ T is not (JV, r)-free. Hence if (M, l*)-freeness is 
contained in (JV, r) -freeness it cannot be (M, l*)-free either, so suppose it contains (M, 1*) at the linear 
transformation L : M — > F n . We claim that if we let tt be the projection that maps x = {y|z} to y, then 
7r o L must be a labeled matroid homomorphism from (M, 1*) to (JV, r). 

To see this, note first that the map tt o L is clearly linear. We need to check that it sends every vector 
Vj G M to some vector Wj in N and in addition that the labels of the vectors are preserved. But since M 
has the monotone pattern (1*) the label is always 1; hence, by assumption we have /^ r (L(vj)) = 1 for 
all i. It follows from the way the canonical function was constructed in Definition 3.4 that we must have 
L(vj) = {y,j|z,j} where y.; = w\, for some w_, £ N labeled by Tj = 1, since these are the only vectors for 
which /jy r evaluates to 1. Thus, tt o L is a labeled matroid homomorphism from (M, 1*) to (JV, r), which 



f b NtT {x) = 4 r ({y|z}) 



= < 




if y = and z € So', 

if y = Wj € JV and z € 5, 

otherwise. 



(3-4) 



establishes the claim. 



□ 
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For future use, we note that the choice of S in the construction of the canonical function /jy T did not 
matter at all in the above proof. This fact will allow us later to exploit the observation made at the end of 
Section 3.2. 

We can use Theorem 3.6 not only to separate monotone properties from non-monotone ones, but also 
to separate two non-monotone properties from each other. Namely, let (A rl ,r 1 ) and (iV 2 ,r 2 ) be non- 
monotone labeled matroids such that (iV 1 , r 1 ) is a submatroid of (iV 2 , r 2 ) (for instance, by being a labeled 
subgraph). Suppose furthermore that we can find a monotone matroid (M, 1*) having the property that 
(M, 1*) ^ (N 2 ,t 2 ) but (M, 1*) </* (iV 1 , t 1 ). Then it follows from Lemma 3.1 and Theorem 3.6 that 
(N 1 , t 1 ) -freeness must be strictly contained in (iV 2 , r 2 ) -freeness. We will see examples of such results in 
Section 5. 

However, while this is already considerably stronger than the monotone separation results in [BCSX09], 
it is still not quite satisfactory. The problem is that results obtained in this manner do not show that non- 
monotonicity adds anything essential to matroid freeness properties. For all that we know, it might be 
the case that (N 1 , r 1 ) -freeness is identical to (M, l*)-freeness so that the only essential constraint is the 
monotone one and the non-monotone constraints are just syntactic sugar. Our second dichotomy theorem, 
while being more restricted in the structural conditions it places on the matroids, is also much more powerful 
in that it directly separates non-monotone properties without going via monotone ones. 

Theorem 3.7 (Second dichotomy theorem). Let N be a matroid in ¥ d in standard representation con- 
taining M(if<i'+i) a s a submatroid on the first d! < d basis vectors (i.e., for ei, . . . , all sums ej + ej, 
1 < i < j < cZ' are also vectors in N), and let d 1* be the pattern for N that gives ^-labels to the vectors 
ei, . . . , Gd' and 1-labels to all other vectors. 

Then (M (K c ),O c ~ 1 l*)-freeness is contained in (N, d \*)-freeness if and only if there is a labeled ma- 
troid homomorphism from (M(K C ), C_1 1*) to (N, d 'l*); otherwise (M(K C ), O c ~ 1 l*)-freeness is ^-sepa- 
rated from (N,0 d l*)-freeness. 

The two theorems 3.6 and 3.7 together constitute the formal version of our second main result in Theo- 
rem 1.4. We remark that in contrast to Theorem 3.6, in Theorem 3.7 we are only considering the case when 
the range of the functions in our properties is 1Z = {0, 1}. 

Proof of Theorem 3.7. The "if" direction is again Lemma 3.1. For the "only if" direction, suppose that 
(M(K C ), O c ~ 1 l*)-freeness is contained in (N, d l*)-freeness. Consider / = /^ ro<i ' 1 ,> where we point out 
that we are now padding the canonical function with ones (as opposed to the zero-padding in the proof of 
Theorem 3.6). We know from Lemma 3.5 that / is far from being (N, d l*)-free. Hence, by our assumption 
it cannot be (M(K C ), O^l^-free either. Suppose that / contains (M{K C ), O^l*) at L : M(K C ) -> ¥ n . 
Let tt be the projection that maps x = {y|z} to y. We want to argue that ir o L must be a labeled matroid 
homomorphism from (M(K C ), O^l*) to (N, d 'l*). 

Let us first focus on the basis vectors in M(K C ), which we will denote fi, . . . , f c _i, and which are all 
0-labeled. Since / applied on the image of M(K C ) under L evaluates to the pattern (0 C_1 1*), we have 
/(L(fj)) = for all i £ [c — 1]. Looking at the definition of /, this means that L(Ji) = {yj|zj} where 
either = e/ for some ej 6 N, I < d! , or else y, = 0. We also note for the record, since we will need it 
later in the proof, that we must have z, 6 S in both of these cases. 

Clearly, if L(fj) = {y% |zj} for y, = 0, the linear map tt o L is no matroid homomorphism (since ^ N) 
and the construction breaks down. We claim, however, that this can never happen. Given this claim, all basis 
vectors fj £ M(K C ) must then be mapped by L to {e^ |zj} for some e; i £ N, li < d' and some z$ € S. The 
only other vectors in M(K C ) are sums fj + fj, and by linearity we have (tt o L)(fj + fj) = e/. + ej. £ ¥ d 
for 1^, lj < d! . Again we have two cases. If ^ e/., then by the assumptions in the statement of the 
theorem we have that e; i + is a vector in N labeled by 1 as desired. If, however, e; i = e/ . , then fj + fj 
gets mapped to and the construction breaks down. This cannot happen, however, since it would imply that 
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/(L(fj + fj)) = /({0|zj + Zj}) = 0. (Again for the record, this holds because Zj, zj £ S implies that also 
Zi + Zj £ S, since 5 is a linear subspace). But /(L(fj + fj)) = ^ 1 contradicts the assumption that / 
evaluates to the pattern (0 C_1 1*) on the image of M(K C ) under L. Hence, it o L maps M{K C ) into N while 
preserving labels, i.e., it is a labeled matroid homomorphism. 

It remains to prove the claim that L(fi) ^ {0|zj} for all basis vectors £; € M(K C ). Suppose on the 
contrary that there is a vector fj such that -L(fj) = {0|zj}. Fix some other basis vector fj, j ^ i, in M(K C ) 
that is mapped by L to {yj \zj} for yj G {ei, . . . , e^} U {0}, and consider L(fj + fj) = L(fj) + L{ij) = 
{yjjzj + Zj}. By assumption /(L(fj)) = /(L(fj)) = 0, which by the definition of / implies that Zj , zj G S. 
This in turn means that f(L((i + fj)) = /(L(fj)) = 0. But this is again a contradiction to the assumption 
that / evaluates to the pattern (0 C ~ 1 1*), which requires that f(L(ti + fj)) = 1. The claim follows, and the 
proof of the theorem is complete. □ 

We remark is that unlike the proof for Theorem 3.6, here we crucially use the fact that S is a linear 
subspace and so we cannot replace S by, for instance, a random subset. 



4 Some Labeled Graphic Matroid Non-Homomorphisms 

In order for the method developed in the previous section to be useful, we need to find (families of) labeled 
matroids that do not embed homomorphically into each other. In this section, we establish such matroid 
non-homomorphism results for graphic matroids. Recall that for labeled graphic matroids (M(G),a) and 
(M(H) , r), which we will from now identify with their underlying labeled graphs (G, a) and (H, r) for ease 
of notation, the matroid vectors correspond to edges in the graphs, and a labeled matroid homomorphism is 
a mapping of edges to edges that preserves labels and cycles. 

The key to all of our non-homomorphism results is (the proof of) the following lemma. 

Lemma 4.1. For all d > 5, there is no labeled matroid homomorphism from (Kj, a) to (if^_i,r) for any 
patterns a and r. 

We remark that as shown in Proposition 3.3, there is in fact a homomorphism from (K$, 1*) to (K3, 1*). 
Thus, the condition d > 5 above is necessary. 

To prove Lemma 4.1, we ignore the patterns a and r and instead argue directly that regardless of what 
these patterns look like, there can exist no edge homomorphism from to Kd-i that preserves cycle 
structure. This argument rests on two simple but very useful claims. 

To state these claims, we recall from Definition 2.6 that the standard representation of i-Q is to fix some 
vertex v and pick as a basis the vectors corresponding to all edges e incident to this vertex. Even once 
we have fixed such a vertex and associated its edges with unit vectors ei, e2, . . . , e^-i, we can get another 
essentially equivalent basis by fixing any other vertex and looking at the vectors corresponding to edges 
incident to that vertex instead, as explained after Definition 2.6. We will refer to any such basis, which in 
this section we identify with the corresponding set of edges, as a standard form basis. 

Claim 4.2. For any c, d > 3, if (K^, a) =->■ (K C) r), then it holds that any two incident edges in must 
map to distinct edges in K c . 

Claim 4.3. For any d > 5 and c > 4, if a) <^-» (K c , r), then all edges in any standard form basis of 
Kd must map to edges in K c that are all incident to one common vertex. 

Given these two claims, Lemma 4.1 follows immediately by a pigeonhole argument: since a standard 
form basis in has d—1 edges while all vertices in Kd~i only has d — 2 incident edges, the claims 4.2 
and 4.3 cannot possibly both hold simultaneously. This proves the lemma by contradiction. 
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It remains to establish the claims, and we do so next. Note again that we write v to denote a vertex and 
e to denote an edge, whereas vectors are denoted e, f , v, w, et cetera. In what follows below, we will go 
freely back and forth between edge and vector representation of the graphic matroids. 

The first claim is more or less immediate. 

Proof of Claim 4.2. Let e\ = (vi,Vj) and e<i = (vi,Vk) be two incident edges in Kd and suppose that they 
map to the same edge in K c . Now ei and e2 form a cycle in Kd together with e% = (vj,Vk), but since K c 
does not have self-loops there is no way to map to an edge in K c so that this cycle is preserved. (Or, 
reasoning in terms of matroid vectors in a linear space over F2, since the vectors for e± and e3 map to the 
same vector in K c and hence cancel, there is no way to map the third vector to a non-zero vector in K% so 
that the sum of the images of all three vectors cancel.) □ 

The second claim is not much harder, but requires a little more work. 

Proof of Claim 4.3. Suppose Kd embeds into K c by a linear map <\> and let {f*i, . . . , f^-i} be the basis 
vectors of Kd and {ei, . . . , e c _i} be the basis vectors of K c . We show by induction on k, 1 < k < d — 1, 
that {fi, . . . , ffc} must map to (distinct) edges incident to some common vertex v in K c . 

Without loss of generality, assume 4>(fi) = ei (if fi would map to any other vector we just make a 
basis change in K c as explained after Definition 2.6). By Claim 4.2 we have 7^ ei- Also note that 

4>(f2) 7^ ej + ej with 1 < i < j, because this would imply 4>(fi + £2) = ei + e^ + ej, which is weight-3 
vector that is not a member of K c . (In terms of edges, this would correspond to incident edges e\ = (vi,Vj) 
and e2 = {vi,Vk) in Kd mapping to non-incident edges in K c , but if so there would be no way the map 
could preserve the cycle of e\ and e2 with es = (vj,Vk)-) 

Therefore we are left with two cases: = &i or (p^) = ei + e^ for some i > 1, and because of 

symmetry we may choose i = 2 without loss of generality. Let us analyze these two cases. 5 

Case 1 (4>{h) = ^2)' Consider where </> can send i^. Clearly <^(fs) ^ {ei,e2} = {0(fi), (ftih)} = by 
the distinctness in Claim 4.2. Also, we claim that 7^ ei + e2- To see this, observe that if 

{0(fi), <ft(f2), <A(f3)} = { e ii e 2 5 ei + e2}, then <j) would have to map to some vector outside of 
this set by distinctness, but if so it in turns follows that + f±) = ^(is) + ^(£4) would be a 
vector of weight at least 3. (Notice that here we crucially use d > 5). To conclude, we argue that 
in fact 0(fa) / ej + ej for any i < j with j > 2. For if i = 1, say, it follows just as above that 
4>(f2 + f2) = 4>(^2) + 4>(k) would be a weight-3 vector. Hence we must have 4>(fs = ej for some i, 
which we may without loss of generality set to i = 3. 

Case 2 (4>(f2) = ei + e2): In this case we have </>(fs) 7^ ei by distinctness, and for the same reasons as 
in case 1 we deduce that </>(fs) 7^ e2. Furthermore, <p(fs) 7^ for i > 2, since if so ^(f2 + fa) 
would be a weight-3 vector. Hence, (j)^) = + ej for some i < j. But then we must have i = 1, 
since otherwise 4>(fi + fy) would have weight 3. We have proven that in this case as well, the vectors 
4>(fi) = ei, (f)(f2) = ei + e2, and (fife) = ei + ej must correspond to edges incident to a common 
vertex. 

Now we proceed to the inductive step. Suppose 4> maps the vectors {fi, . . . , f^} to k edges incident to 
a single vertex in K c for some k > 4. Without loss of generality, we assume that the vectors are mapped 
to ei, . . . , in K c (because again, since we know that the edges are incident we are free to make a basis 
change in K c so that we get the standard basis in terms of unit vectors). Consider the image of edge f^ + i 

5 In fact, the attentive reader might have noted here that without loss of generality we can restrict ourselves to only one case and 
fix (f>(f2) = e2. This is so since in the other case we can again make a basis change in K c as described after Definition 2.6 to get 
a new standard basis containing ei and ei + e2. However, we believe that a formal case analysis as given in this proof, although 
strictly speaking unnecessary, is easier to follow. 
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in K c . Clearly, 0(ffc + i) ^ ej for any i G [/c] by distinctness. It is also easy to see that 4>{fk+i) cannot be 
any weight-2 vector. For if we would have 4>(fk+i) = ej + ej, then since k > 3 there would exist some 
/ 6 [k] with I ^ {hj} such that <ft(fi + ffc+i) = ej + ej + ^ l\~ c , which is a contradiction. Therefore, 
0(ffc+i) = e i f° r some i > k. This completes the induction step and thus finishes the proof of the claim. □ 

Next, we study labeled homomorphisms from into itself, and rule out that a monotone pattern can 
be mapped homomorphically into a non-monotone one. 

Lemma 4.4. For all d > 5 and c > 1, there is no labeled matroid homomorphism from (Kd, 1*) to 
(K d ,O c l*)- 

Proof. By Claims 4.2 and 4.3 it must be the case that all the basis vectors of Kd map in a one-to-one 
and onto fashion to distinct edges adjacent to a single vertex v± in Kd, say 4>( e i) = ( v ii v 2),<P( e 2) = 
(v\, v^), . . . , 4>(ed-i) = (vi,Vd-i)- Since labels are preserved by the mapping, none of the basis vectors ej 
maps to a 0-labeled edge in (iQ,O c l*). However, consider any 0-labeled edge e = (vi,Vj) in (.fQ,O c l*). 
Notice that since is a homomorphism we must have 0(ej + e^) = </>(ej) + 4>(ej) = (vi,vj), that is, ej + e^- 
in (Kd, 1*) maps to this 0-labeled edge. But if so (j) is not label-preserving, since all vectors in (Kd, 1*) are 
labeled 1. Contradiction. □ 

A second useful lemma about non-monotone patterns is as follows. 

Lemma 4.5. Foralld > 4, there is no labeled matroid homomorphism from (Kd, d ~ l l*) to (Kd, d ~ 2 l*). 

Proof. By Claims 4.2 and 4.3 it must be that all the d — 1 basis vectors of (Kd, d_1 l*) map in a one-to- 
one fashion to distinct edges adjacent to a single vertex in (Kd, d ~ l l*). Notice that the basis vectors of 
(Kd, d_1 l*) are all labeled by and hence they have to map to the 0-labeled edges in (K d ,0 d - 2 1*). But 
there are only d — 2 such edges in (Kd, d ~ 2 l*), and since the basis vectors must be mapped to distinct 
edges it immediately follows that such a homomorphism is not possible to construct. □ 



5 Infinite Hierarchies of Well Separated Matroid Freeness Properties 

We have finally reached the point where we can put all the material in Sections 3 and 4 together and prove 
the existence of infinite hierarchies of (M, a) -freeness properties as claimed in the introduction. We remark 
that the techniques we have developed could be used to yield many different such hierarchies, but for brevity 
and concreteness we will focus below on one particular result that illustrates this general point. Namely, the 
lemmas proven in this section all lead up to Theorem 5.4, which is the formal statement of our third main 
result claimed in Theorem 1.5 in the introduction. 

Let us start by proving that monotone M(.fQ)-freeness properties form a strict hierarchy. Note that we 
will continue the mild abuse of notation introduced in Section 4 by identifying a graphic matroid (M(G),a) 
and its underlying labeled graph (G, a). 

Lemma 5.1. For d > 4, the (Kd,l*)-freeness properties form an infinite hierarchy of strictly contained 
properties. 

Proof. Since (Kd, 1*) is a labeled subgraph of (Kd+\, 1*) we have that (Kd, l*)-freeness is contained in 
(Kd+\, l*)-freeness by Corollary 3.2. Since there is no homomorphism from (Kd+i, 1*) to (Kd, 1*) for 
d > 4 according to Lemma 4.1, we conclude from Theorem 3.6 that the containment must be strict in a 
property testing sense. □ 

We remark that this lemma improves on a similar theorem of [BCSX09], which could only show sepa- 
ration between the graphic matroids of Kd and K^ +2 . But we can strengthen Lemma 5.1 even further as 
follows. 
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Lemma 5.2. For d > 4 and any sequence {cd}^L 4 with 1 < c d < d, the (K d ,0 Cd l*)-freeness properties 
form an infinite sequence of properties such that (K d +\, Cd+1 l*)-freeness is 5-separated from (Kd, Cd l*)- 
freeness. If in addition the c^-sequence is monotone increasing, i.e., Cd+i > c d , then we get an infinite 
hierarchy of strictly contained properties. 

Proof. (K d , 1*) does not embed in (.fQ,0 Cd l*) by Lemma 4.4. However, (K d , 1*) is a labeled subgraph 
of (Kd+i, Cd+1 1*), since if we throw away the unique vertex in (K d +i, Cd+1 1*) incident to all 0-labeled 
edges what we have left is exactly (K d , 1*). It follows that the function f K c dl », which is far from being 
(K d , Cd l*)-free, is (K d , l*)-free by Theorem 3.6 and hence (K d+ i, c <*+i l*)-free by Corollary 3.2 

If it furthermore holds that c d < c d+ \ < d, then (K d , Cd l*) is a labeled subgraph of (K d+ \, Cd+1 l*), 
which gives containment of the corresponding matroid freeness properties (again by Corollary 3.2). □ 

Observe that Lemma 5.2 is indeed a strengthening of Lemma 5.1. This is so since the functions 
f K c di*> which are (K d , l*)-free but far from (Kd-i, l*)-free, witness that (K d , l*)-freeness is (5-separated 
from (K d -i, l*)-freeness (and containment in the other direction is obvious since (K d -i, 1*) is a subgraph 
of (K d ,l*)). 

Notice, however, that as discussed after the proof of Theorem 3.6, the way we establish Lemma 5.2 
does not ensure that non-monotone matroid freeness properties are nontrivial. We might worry that perhaps 
all the non-monotone properties coincide with the intermediate monotone properties of (K d , l*)-freeness 
used to obtain the separation. The next lemma provides some assurance us by conclusively ruling out this 
possibility. 

Lemma 5.3. For d > 4, it holds that (K d+ i,O d l*)-freeness is 5-separated from the union of (K d , O^" 1 !*)- 
freeness and (K d , l*)-freeness. 

Proof. Note first that this lemma is conceptually different from the preceding ones, since here we need 
to find a function that is (iQ+i,0 l*)-free but is simultaneously far from being (K d , O d ~ 1 l*)-free and 
(K d , l*)-free. On the face of it, such cases are not covered by the techniques in Sections 3 and 4, which only 
relates pair of labeled matroids. However, there is a way to get around this obstacle by finding an "interme- 
diate" labeled matroid such that (K d , O^ -1 !*) and (K d , 1*) both embed into this matroid but (K d +i, d l*) 
does not. We pick this intermediate matroid to be (K d +i, O^ 1 !*), that is, the graphic matroid over the 
complete graph on d + 1 vertices that have all edges but one in the standard basis labeled by and has 
1-labels everywhere else. 

It is easy to check that (K d , d_1 l*) and (K d , 1*) are both labeled subgraphs of (K d+1 , O^" 1 !*). Con- 
sequently, we can appeal to Lemma 3.5 to conclude that the canonical function fx c^-H* ^ s ^ ense i n v i°" 
lations of both (K d , O d_1 l*)-freeness and (K d , l*)-freeness. However, Lemma 4.5 shows that (K d +\, d l*) 
does not embed homomorphically into (K d+ i,O d ~ l l*), and therefore /* n d-n» must be (K d +i, d l*)- 
free according to Theorem 3.7. The lemma follows. □ 

Combining all of these lemmas, we can prove that non-monotone graphic matroid freeness properties 
provide infinite hierarchies of strictly contained properties. The reader might be helped in parsing the next 
theorem and its proof by looking at the illustration in Figure 2. 

Theorem 5.4. Let A d denote the set of all (Kd, I*) -free functions and Bd the set of all (K d , O d ~ 1 l*)-free 
functions in UneN+{^2 ~~ ^ !}}• Then the following holds: 

1. For d > 4, A d forms an infinite hierarchy of strictly contained properties. 

2. For d> 4, Bd forms an infinite hierarchy of strictly contained properties. 

3. n~ =4 (A*n2? d )^0. 
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Figure 2: Illustration of hierarchies of properties Ad (dashed) and Bd (dotted) and separating functions. 

4. (Ud^=4 &d) \ (U(£=4 Ad) 7^ 0, and in fact the former union of properties is 5-separated from the latter. 

5. For d > 4, Ad U Bd is strictly contained in Bd+i- 

6. For d > 4, Ad and Bd are mutually well separated from one another. 

7. For d > 5, all the properties Ad, Bd, Ad \ Az-i> and Bd \ Bd-i are far from being low-degree 
polynomials. 

Proof. The proofs of Claims 1 and 2 were given in Lemma 5.1 and Lemma 5.2, respectively, and as was 
discussed after the proof of Lemma 5.2 we can in fact see that both of these hierarchies are witnessed by 
the functions gd = f% d c dl * as plotted schematically in Figure 2. To show claim 3, we observe that the 
constant function : FJ? ' — > {0, 1} sending all points to belongs to all of the properties Ad and Bd- 
Claim 4 similarly follows since the constant function 1 : FJj — > {0, 1} sending all points to 1 must be 
(Kd, d-1 l*)-free simply by virtue of not having any zeros, while it is far from (Kd, l*)-free for exactly 
the same reason. Claim 5 was established in Lemma 5.3, using the functions hd = f] { d-2 1 *> an d taken 
together, the functions gd and hd can be seen to witness the mutual separations in claim 6. 

Consider finally claim 7. Recall that what we want to say is that the properties Ad and Bd are "new" 
properties not known to be testable before. Note that by necessity, such a statement must be somewhat 
informal — unless we can provide a full enumeration of all testable properties and separate our new proper- 
ties from all of them via some kind of diagonalization argument, which arguably seems neither feasible nor 
particularly reasonable. But what seems natural to do is to prove formally that matroid freeness properties 
are not identical to the "usual suspects", which in this case would seem to be low-degree polynomials. 

We remark that one can first make the easy observation that it cannot possibly be the case that all of 
the properties Ad and Bd are low-degree polynomials. If they were, there would be no way they could nest 
and intersect in the way shown in Figure 2, since low-degree polynomials just form one strict hierarchy 
of concentric circles with respect to degree. We want to prove something stronger, however, namely that 
none of the properties Ad and Bd can be just low-degree polynomials. The way we do this is to observe 
that we can modify the construction of canonical functions in Definition 3.4 slightly as in Equation (3.4) to 
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get a huge family of canonical functions instead of just one, and that this can be done in such a way that 
Lemma 3.5 and Theorem 3.6 still hold (we refer to the discussion in Section 3 for the details). What this 
means is that we can think of every witnessing function in Figure 2 as being a large, dense cloud of such 
functions, and these functions are simply far too many to all be low-degree polynomials, or even to be close 
to low-degree polynomials. This establishes the claim, and the theorem follows. □ 

The attentive reader might have noticed that there is one natural piece missing in Theorem 5.4, namely 
the claim that Bd+\ \ (Ad U Bd) is also far from being just low-degree polynomials. This seems very likely 
to be the case but we are currently unable to prove this. It is easy to prove that there are polynomials of very 
high degree in Bd+\ \ {Ad U Bd)- The way to see this is to define hd = 0d _ 2l , as in Definition 3.4 
except that we choose S to be a very small subspace, say of constant dimension. Then hd will evaluate 
to 1 everywhere except at a constant number of points, and therefore it cannot possibly be a low-degree 
polynomial. However, all such hd are also close to the constant function evaluating to 1 everywhere, which 
has very low degree indeed. One natural idea is instead to pick the set S randomly to get a large number 
of functions hd, and then argue that they are so many that here must be examples of such functions that are 
far from being low-degree polynomials. Unfortunately, this does not work. The proof of Theorem 3.7 turns 
out to be surprisingly delicate, and provably requires S to be a subspace. We still strongly believe that the 
properties Bd+i \ {Ad U Bd) are far from low-degree polynomials for all d > 4, but it seems new techniques 
would be required to establish such a claim. 

6 Concluding Remarks 

Motivated by questions raised in [BCSX09] and the recent testability results in [BGS10], in this paper we 
have studied the semantics of matroid freeness properties, and in particular the problem of determining when 
two syntactically different matroid constraints in fact also encode semantically different properties. We 
have developed a new method for comparing matroid freeness constraints based on the concept of labeled 
matroid homomorphisms, and have shown that for a suiprisingly broad class of matroid freeness properties 
this method exactly characterizes the relation between two matroid freeness properties. Even more, when 
the method works, it in fact establishes a strong dichotomy in the sense that either one property must be 
contained in the other or the properties are strictly distinct in a property testing sense. As a consequence, 
we established that results in [BGS10] do indeed provide infinite hierarchies of new properties not known 
to have been testable before. 

Our work raises many interesting questions which we believe merit further study. Perhaps the most 
obvious open problem is in what generality our method of characterizing matroid freeness properties in terms 
of labeled homomorphisms can be made to work, and in particular whether it can be extended to arbitrary 
labeled graphic matroids, or even arbitrary linear matroids. That is, is it always true that (M, cr)-freeness is 
contained in (N, r)-freeness if and only if there is a labeled matroid homomorphism from (M, a) to (N, r), 
and that the two properties must be well separated otherwise? As was explained in Section 2, complete 
graphic matroids (M(Kd),a) can be seen to be building blocks for all labeled graph matroid freeness 
properties. Thus, a first step towards the resolution of this question might be to understand (M(Kd), cr)- 
freeness for any pattern a, and then study how intersections of such properties behave. 

Leaving aside the issue of labeled matroid homomorphisms, another fundamental open problem is 
whether there must always hold a dichotomy between containment and (5-separation for matroid freeness 
properties. If this would turn out to be the case, the next question is whether such a dichotomy would extend 
even further to arbitrary linear-invariant properties. 

At the core of all this is the problem of determining when (M, <r)-freeness and (N, r)-freeness are 
identical properties for two syntactically different labeled matroids (M, a) and (N, r). If we look at graphic 
matroids, one observation is that blowing up the underlying graph does not change the property. Formally, 
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for a graph G and for a positive integer t, define the order-t blowup of G to be the graph G^ obtained by 
replacing each vertex of G by an independent set of size t and each edge in G by the complete bipartite 
graph K t t . Furthermore, if an edge of G is labeled by an element of {0, 1}, then use that label for all edges 
in the associated complete bipartite graph in the blowup graph; if a was the original labeling for the edges, 
we call the new labeling for the blowup graph a^K 

The fact that graph blow-ups preserve matroid freeness properties, which we write down as Proposition 
6.1 below, is similar in flavor to the Erdos-Stone theorem from extremal graph theory. The Erdos-Stone 
theorem essentially says that for any graph G and integer t > 1, G-freeeness (i.e., not containing G as an 
induced subgraph) and G^> -freeness are not <5-separated for any constant S > 0; see, for example, [Die05]. 
However, the proof of the analogous statement in the matroid freeness case turns out to be much simpler, 
because a matroid homomorphism is not required to be injective whereas the subgraph relationship in graphs 
is an injection. 

Proposition 6.1. Given a graph G on m edges and a string a € {0, l} m , suppose H is a subgraph ofG^ 
for some t > 1 that contains at least one copy ofG. Also, suppose r is the restriction of a^' to the edges of 
H. Then, (M(G), a) -freeness is identical to (M(H),r)-freeness. 

The proof is straightforward. The fact that (M(G), <r)-freeness is contained in (M(H), r)-freeness 
follows from Corollary 3.2. The other direction holds because the map that takes each edge of H to the edge 
of G from where it originated is a labeled matroid homomorphism from (M(H),t) to (M(G),a), and so 
we can apply Lemma 3.1. 

It should be noted that Proposition 6. 1 is not a characterization of equality even for monotone graphic 
matroid freeness properties. For instance, while K4 is easily seen not to be a subgraph of any blowup of K%, 
it nevertheless holds that (M(K^), l*)-freeness and (M(K^), l*)-freeness are identical properties as shown 
in Proposition 3.3. What our dichotomy theorems in Section 3.3 establish is that for all monotone properties 
and a nontrivial subclass of non-monotone properties, equality of properties corresponds exactly to existence 
of matroid homomorphisms in both directions. As noted above, the question whether such a correspondence 
holds in general for any non-monotone matroid freeness properties remains wide open. 
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A Matroids, Matroid Freeness and Systems of Linear Equations 

Let us start this appendix by giving a formal definition of what a matroid is for completeness. There are 
many equivalent ways to define a matroid, and for some of the different formulations it is in fact nontrivial to 
show that they are equivalent. We will use the definition presented next, and refer the reader to, for instance, 
[Oxl03, Wil73] for more background on matroid theory. 

Definition A.l (Matroid). A matroid M is a finite set S, along with a set X of subsets of S, such that: 

1 . The empty set is in I. 

2. If X is in X, then every subset of X is also in I. 

3. If X and Y are both inXand |X| = \Y\ + 1, then there exists an element x € X\Y such that YU {x} 
is in X. 

The set S is called the ground set of M, and the set X is the collection of independent sets of M. Those 
subsets of S which are not in X are called dependent. A maximal independent set — that is, an independent 
set X which becomes dependent on adding any element of S — is called a basis for the matroid. It is a 
basic result of matroid theory that any two bases of a matroid M must have the same number of elements. 
This number is called the rank of M. 

Two important classes of matroids, which we briefly discuss next, are linear matroids and graphic 
matroids. 

We say that a matroid M on a ground set S = {x±, . . . , Xk} is a linear matroid, or vector matroid, if there 
is a field F and vectors vi , . . . , v& in ¥ k such that any subset {xi \ i € T} indexed by T C [k] is independent 
if and only if the corresponding vectors {vj | i G T} form a linearly independent set. A matroid is binary if 
it is linear with F = F2. Note that when we are interested in property testing of linear-invariant properties, 
the only matroid freeness properties that really make sense to consider are those of linear matroids. 

Given a graph G, we can let S be the set of edges E(G) of G and X consist of the subsets of S = E(G) 
that do not contain any cycles in the graph. Then M = (S, X) can be shown to be a matroid, which we refer 
to as the graphic matroid M(G) over G. Any graphic matroid M(G) can be represented as a binary matroid. 
One way of seeing this is to consider the incidence matrix of G and let v» be the rows corresponding to the 
edges. Then any cycle in G will correspond to a (subset of) vectors summing to zero. Another possibility is 
to fix any spanning tree of G and let the edges e± , , ■ ■ ■ in this spanning tree T correspond to unit vectors 

ei, e2, Then any edge e not in the spanning tree T will correspond to the sum of the vectors for the 

unique minimal set of edges in T that together with e yields a cycle. 

As the reader can see, in this paper we used the latter approach with spanning trees emanating from a 
single, unique vertex to get our standard representation for M{K^) (Definition 2.6). Another possibility 
would have been to use the incidence matrix representation. Note that this would have given a very nice 
and symmetric representation with all vectors in M(Kd) having Hamming weight 2, and with a basis cor- 
responding to fixing some coordinate j and requiring that all the basis vectors have a 1 in this coordinate. 
Of course, our standard representation is just taking such a basis and "puncturing" it by deleting the j 
coordinate from all vectors. For our purposes, it somehow turned out that it was very convenient to have a 
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110 1 

(a) M{Ka) = {ei,e 2 ,e :i ,ei+e 2 ,ei+e3,e2+e3}. (b) Matrix encoding v 4 = ei +e 2 , v 5 = ei +e 3 , and v 6 = e 2 + e 3 . 

Figure 3: The binary graphic matroid M(K A ) and the corresponding linear equation system matrix. 

representation where all basis vectors and weight 1 and all other vectors had weight 2. However, we just 
want to point out that this is not the only way of thinking about M(Kd), and that it might be interesting 
when trying to generalize our results to investigate whether the representation with all vectors of uniform 
weight 2 might be a more fruitful way of looking at M (KA. 

Let us now explain how one can see that the matroid-freeness representation of properties employed 
in [BCSX09] and the current paper on the one hand, and the system of linear equations representation of 
properties used in [KSV09, Sha09, BGS10] on the other, are essentially equivalent. This equivalence is in 
some sense folklore knowledge, but since it is not entirely obvious a priori, and since we have not seen it 
actually written down anywhere, we give an explicit exposition of the correspondence here for completeness. 
Note, however, that we will only discuss monotone properties below. Non-monotone properties have also 
been formulated using systems of linear equations, in this context most notably in [BGS10], but since the 
notation is a bit heavier and the ideas are essentially the same, we ignore the issue of non-monotonicity in 
this appendix for the sake of simplicity. 

The system of linear equations representation is the following. Let K be a field. Let k and £ be fixed 
integers with k < £. Let Ax = b be a system of k linear equations in I variables, where A G K kx£ and 
b G K k . We say a set S C K is (A, h)-free if it contains no solution to Ax. = b; that is, S is (A, b)-free if 
there is no vector x G S e that satisfies all of the k equations in Ax = b. 

When considered as a property testing problem, we usually first pick a finite field F and then take 
K = F n . For properties that arise naturally in mathematics and computer science, it is usually the case that 
the property can be specified uniformly for all n using a finite description. Thus, it is of particular interest 
to consider the case when A and b have finite descriptions. Thus, A is usually taken as having entries over 
F, not K. Furthermore, in order for the properties to be linear-invariant we take b = 0. (We note, however, 
that the results in [KSV09, Sha09] also hold when we can pick A (non-uniformly) as any matrix in K kxe 
and for any K, not just IK = F n , and for b / 0.) 

As a first simple example, the system of linear equations representation corresponding to (M(Ci), 1*)- 
freeness, where Cg is the cycle of length £, consists of just a single linear equation ^2 i=1 Xi = 0. Therefore 
we have A = [1 1 1 • • • 1] and b = 0, encoding that the sum of I vectors is zero. Another simple, but less 
trivial, example is that of (K4, l*)-freeness. Figure 3 shows the graphic matroid M{Ki) and the matrix A 
its corresponding representation as a system of linear equations. 

Let us now consider (M, 1* )-freeness for a general linear matroid M = {vi , . . . , v^_fc , vg-k+i , ■ ■ ■ , Yg}, 
where {vi, . . . , v^_fc} form a basis for the matroid and each of the vectors in {v^_fc +1 , . . . , v^} can be 
written as a linear combination of the first ^ — A; vectors = — X^j=i wrm coefficients Bij G F. 

Without loss of generality, we can think of the first I — k vectors as being vi = ei, . . . vg-k = ^t-k- To 
transform (M, 1* )-freeness into a system of linear equations representation, we construct a matrix A G W kx£ 




23 



SEPARATIONS OF MATROID FREENESS PROPERTIES 



in which the i row consists of the coefficients of the linear equations describing v^_fc + j. Specifically, for 
every 1 < i < k and 1 < j < I, Aij = Bij when 1 < j < I — k; Aij = 1 if j = t — k + i and Aij = 
otherwise. 

To go in the other direction and transform a system of linear equations into matroid freeness represen- 
tation, we may assume without loss of generality that the matrix A has rank k (otherwise we may delete the 
redundant rows). Then by permuting the columns of A and appropriately changing the basis of K, we can 
transform A into the form [-B|/fe], where 1^ is a k-by-k identity matrix and B is a A;-by-(^ — k) matrix. Now 
B = {Bij} is exactly the matrix we defined above which contains the coefficients of k linear combinations 
of the non-basis vectors in the matroid in terms of the £ — k basis vectors, so it is immediate to recover the 
matroid freeness representation from this matrix. 
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